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Abstract — A factor-graph representation of quantum-mechani- 
cal probabilities is proposed. Unlike standard statistical models, 
the proposed representation uses auxiliary variables (state vari- 
ables) that are not random variables. 

I. Introduction 

Statistical models with many variables are often represented 
by factor graphs [ 1 1-[4| or similar graphical representations 
||3j-j7]. Such graphical representations can be helpful in 
various ways, including elucidation of the model itself as well 
as the derivation of algorithms for statistical inference. 

So far, however, quantum mechanics (e.g., J8|, J9}) has 
been standing apart. Despite being a statistical theory, quan- 
tum mechanics does not seem to fit into standard statistical 
categories. Indeed, it has often been emphasized that quantum 
mechanics is a generalization of probability theory that cannot 
be understood in terms of "classical" statistical modeling. 

In this paper, we propose the different perspective that the 
probabilities in quantum mechanics are quite ordinary, but 
their state-space representation is of a type not previously used 
in statistical modeling. In particular, we propose a factor-graph 
representation of quantum mechanics that correctly represents 
the joint probability distribution of any number of measure- 
ments. Like most statistical models, the proposed factor graphs 
use auxiliary variables (state variables) in addition to the 
actually observed variables; however, in contrast to standard 
statistical models, the auxiliary variables in the proposed factor 
graphs are not random variables. Nonetheless, the probabilities 
of the observations are marginals of the factor graph, as in 
standard statistical models. 

The paper is structured as follows. Section |TT] reviews 
factor graphs and their connection to linear algebra and tensor 



diagrams. Section III makes the pivotal observation that factor 
graphs with complex factors and with auxiliary variables 
that are not random variables can represent probability mass 



functions. The main results are given in Section IV 

We will use standard linear algebra notation rather than 
the bra-ket notation of quantum mechanics. The Hermitian 
transpose of a complex matrix A will be denoted by A H = A T , 
where A T is the transpose of A and A is the componentwise 
complex conjugate. An identity matrix will be denoted by I. 

II. On Factor Graphs and Matrices 

Factor graphs represent factorizations of functions of several 
variables. In this paper, all variables take values in finite 
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Fig. 2. Closing boxes in factor graphs. 

alphabets and all functions take values in C. We will use 
Forney factor graphs (also called normal factor graphs) as 
in j3J, (3) where nodes/boxes represent factors and edges 
represent variables. For example, assume that some function 
f(xi, . . . , £5) can be written as 

f(xi, ...,X 5 ) = fl(x 1 ,X 2 ,X 5 )f 2 (x 2 ,X 3 )f 3 (x 3 ,X 4: ,X 5 ). (1) 

The corresponding factor graph is shown in Fig. [T] 

The Forney factor-graph notation is intimately connected 
with the idea of "closing boxes" by summing over inter- 
nal variables |2j. For example, closing the inner dashed 
box in Fig. [2] replaces the two nodes/factors f 2 (x 2 , x 3 ) and 
f 3 (x 3 , X4, £5) by the single node/factor 



g(x 2 ,x 4: ,x 5 ) 



X3 



f2(x 2 ,x 3 )f 3 (x 3 ,x 4: ,x 5 ); 



(2) 



closing the outer dashed box in Fig. [2] replaces all nodes/ 
factors in ([l} by the single node/factor 



f(xi,x 4 ) 



(3) 



and closing first the inner dashed box and then the outer dashed 
box replaces all nodes/factors in ([T]) by 

fi{x!,x 2 ,x 5 )g(x 2 ,X4,x 5 ) = f(xi,Xi). (4) 
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Fig. 3. Factor-graph representation of matrix multiplication ^J. The small 
dot denotes the variable that indexes the rows of the corresponding matrix. 
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Fig. 5. Factor graph of the hidden Markov model J9} for n = 3. 



A A B 

Fig. 4. Factor graph of tr(A) (left) and of tr(-AB) = tr(BA) (right). 



Note the equality between (|4]l and ([3]), which holds in general: 
closing an inner box within some outer box (by summing over 
its internal variables) does not change the closed-box function 
of the outer box. 

A half edge in a factor graph is an edge that is connected 
to only one node (such as xx in Fig. [TJ. The external function 
of a factor graph (in [12|-[14| also called partition function) 
is defined to be the closed-box function of a box that contains 
all nodes and all full edges, but all half edges stick out (such 
as the outer box in Fig. [2]). The external function of Fig. [T] 
is g). 

The equality constraint function / = is defined as 

1, if Xx = ■ ■ ■ = x n 
0, otherwise. 



f=(xx 



i X n ) 



(5) 



The corresponding node (which is denoted by "=") can serve 
as a branching point in a factor graph, cf. Figs. [7||TT] 
A matrix A G C mxn m ay be viewed as a function 



{1, . . . , m} x {1, . . . , n} C : (x,y) ^ A{x, y). 



(6) 



The multiplication of two matrices A and B can then be 
written as 



(ab)(x,z) = J2 A &y) B (y,z), 



(7) 



which is the closed-box function (the external function) of 
Fig. [3] Note that the identity matrix corresponds to an equality 
constraint function f=(x,y). 

In this notation, the trace of a square matrix A is 



(8) 



which is the external function (which is a constant) of the 
factor graph in Fig. [4] Also shown in Fig. [4] is the graphical 
proof of the identity tr (AB) = tr(BA), which is much used 
in quantum mechanics. 

Factor graphs for linear algebra operations such as Fig. [3] 
and Fig. [4] (and the corresponding generalizations to tensors) 
are essentially tensor diagrams (or trace diagrams) as in 
| [T0) , (TT). This connection between factor graphs and tensor 
diagrams was noted in p2)-|[T4[. 



III. Statistical Models with Auxiliary Variables 
Using Complex Factors 

Statistical models usually contain auxiliary variables in 
addition to the observable variables. Consider, for example, 
a hidden Markov model with observables Y\ , . . . , Y„ and 
auxiliary variables (hidden variables) Xq , Xx , 
that 



Xn such 



p(yx 



) 2M, ^0, 



,x n ) =p(x )Y[p(yk,x k \x k -x). (9) 



k=X 



The factor graph of Q is given in Fig. [5] (As shown in 
this example, variables in factor graphs are often denoted 
by capital letters [2].) Closing the dashed box in Fig. [5] 
yields p(yx, ■ ■ ■ , y n ), me probability mass function of the 
observables. 

As illustrated by this example, auxiliary variables in sta- 
tistical models are often introduced in order to obtain nice 
state-space models. 

In traditional statistical models, such auxiliary state vari- 
ables are themselves random variables, and the total model 
is a joint probability law over all variables as, e.g., in (|9j. 
(A statistical model may also contain parameters in addition 
to auxiliary random variables, but such parameters are not 
relevant for the present discussion.) 

The first main point of this paper is this: requiring the 
auxiliary state variables to be random variables may be unnec- 
essarily restrictive. The benefits of state-space representations 
may be obtained by merely requiring a function f(y, x) (with 
a useful factorization) such that the probability mass function 
of the observables is 



p(y) 



f(y,x); 



(10) 



the function f(y,x) need not be a probability mass function 
and it need not even be real valued. 

For example, consider the factor graph in Fig. |6| where all 
factors are complex valued. Note that the lower dashed box 
in Fig. [6] mirrors the upper dashed box, but all factors in the 
lower box are the complex conjugates of the corresponding 
factors in the upper dashed box. The closed-box function of 
the upper dashed box is 

5(2/1,2/2,2/3)= ^2 9x{xx,yx)92{xx,X2,y 2 )g 3 {x2,yz) (11) 

and the closed-box function of the lower dashed box is 

9i K, 2/i) 92 (x'x, x' 2 , y 2 ) g 3 (x' 2 ,y 3 ) = 5(2/1,2/2,2/3)- 

x l -> x 2 

(12) 
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Fig. 6. Factor graph for J/2, ?/3) with complex-valued factors. 



If follows that the closed-box function in Fig. [6] (with both 
dashed boxes closed) is 

5(2/1,2/2,2/3)3(2/1,2/2,1/3) = |ff(2/i,2/2,2/3)| 2 , (13) 

which is real and nonnegative and thus suitable to represent a 
probability mass function p(2/i, 2/2, 2/3) ( U P to a scale factor). 

We will see that factor graphs as in Fig. [6] — with two parts, 
one part being the complex conjugate mirror image of the 
other part — can represent probabilities in quantum mechanics. 

IV. Factor Graphs for Quantum Mechanics 

Consider the factor graph of Fig. [7J In this figure, Uq and U\ 
are M x M unitary matrices, and all variables except Y\ and 
Y 2 take values in the set {1, . . . , M}. The two large boxes in 
the figure represent measurements, as will be detailed below. 
The factor/box p(xo) is a probability mass function over the 
initial state Xo. We will see that this factor graph (with suitable 
modeling of the measurements) represents the joint probability 
mass function p(yi, y 2 ) of a general M-dimensional quantum 
system with two observations Y\ and Y 2 . The generalization 
to more observed variables Y\, Y2, . . . is obvious. 

The unitary matrices Uq and U\ in Fig. [7] represent the 
development of the system between the initial state and the 
first measurement, or between measurements, respectively, 
according to the Schrodinger equation. 

In the most basic case, the initial state Xq — xq is known 
and the measurements look as shown in Fig. [8] where the 
matrices Bi and B 2 are also unitary. In this case, the observed 
variables Y\ and Y 2 take values in {1, . . . , M} as well. Note 
that the lower part of this factor graph is the complex conjugate 
mirror of the upper part (as in Fig. |6j. 

In quantum-mechanical terms, measurements as in Fig.|8]are 
projection measurements with one-dimensional eigenspaces. 
Note that the value of Y\ and Y 2 is the index of the measured 
eigenspace (rather than the corresponding eigenvalue). 



A very general form of measurement is shown in Fig. 11 
In this case, the range of Yk is a finite set 34, and for each 
2/ € 34, the factor Ak(xk,Xk,y) corresponds to a complex 
square matrix Ak{y) (with row index Xk and column index 
Xk) such that 

Y J My) H Mv) = i (14) 

y&Jk 



(see (9] Chap. 2]). Measurements as in Fig. [8] are included as 
a special case with 34 = {1, ■ • ■ , M} and 



A k (y) = Bk(y)B k (y) H , 



(15) 



where Bk(y) denotes the y-th column of Bf.. 

It is clear from Section [III] that the external function of 
Fig. [JJ (with measurements as in Fig. [8] or as in Fig. 1 1 



is real and nonnegative. We now proceed to analyze these 
factor graphs and to verify that they yield the correct quantum- 
mechanical probabilities p(yi,y 2 ) for the respective class of 
measurements. To this end, we need to understand the closed- 
box functions of the dashed boxes in Fig. [9] We begin with 
the dashed box on the right-hand side of Fig. [9] where Y 2 is 
assumed to be unknown. 

Proposition 1. Closing the dashed box on the right-hand side 
in Fig. [9] (with a measurement of Y 2 as in Fig. [8] or as in 
Fig. [TT] but with unknown result of the measurement) reduces 
it to an equality constraint function. □ 

The proof of this proposition and the proofs of the subsequent 
propositions are easy, and are omitted due to space constraints. 

Proposition [JJ guarantees, in particular, that a future mea- 
surement (with as yet unknown results) does not influence 
present or past observations. 

The proposition clearly holds also for the extension of Fig. [7] 
to any finite number of measurements Y\ , Y 2 , . . . and can then 
be applied recursively from right to left. 

Applying reductions according to Proposition [JJ recursively 
from right to left in Fig. [7] leads to the following proposition. 

Proposition 2. The factor graph of Fig. [7] (with measurements 
as in Fig. [8] or as in Fig. [TT]) represents a properly normalized 
probability mass function, i.e., the external function p(yi,y 2 ) 
is real and J2 yi , V2 Pivi, 2/2) = 1- □ 

Consider now the dashed box on the left in Figs. [9] and [TO] 
which turns out to be the density matrix of quantum me- 
chanics. We will distinguish between the closed-box function 
pk(xk, x' k ) before measuring Yk (as in Fig.[9ji, and the closed- 
box function f>k(xk,x' k ) after the observation Yk — yk (as 
in Fig. [TO) . The former is easily seen to be properly nor- 
malized, but the latter needs normalization to satisfy (I61. 



The corresponding matrices will be denoted by pk and pk, 
respectively. The proper normalization can then be expressed 
by the condition 



tr(p fc ) = tr(p fe ) = 1. 



(16) 



Proposition 3 (Unitary Evolution Between Measurements). 

The matrix pk+i is obtained from the matrix pk as 



p k+ i = VkPkUk- 



(17) 

□ 



Proposition 4 (Basic Projection Measurement). In Fig. |7] 
(generalized to any number of observations), if Yk is measured 
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Fig. 7. Factor graph of quantum system with two measurements and the corresponding observations Y\ and Y2. 
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Fig. 8. Important special case of Fig. [T] all matrices are unitary and the initial state Xo = xo is known. In quantum-mechanical terms, such measurements 
are projection measurements with one-dimensional eigenspaces. 
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Fig. 9. The closed-box function of the dashed box on the left is the density matrix pi (21, x'j). If Y2 is not known, the dashed box on the right reduces to 
the constraint X\ = X[. 



OC pi 



X 



p(xo) 



■Q- 



x[ 



X, 



Xo 



X'i 



x' 2 



Xo 



XL 



Y 



Fig. 10. Density matrix p\ after measuring Y\ = yi- 
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Fig. 11. General measurement as in j9] Chap. 2]. Condition \14) must be 
satisfied. 



as in Fig. [8] then 

P(Y k = y\Y k _ 1 =y k ^,...,Y 1 = y 1 ) 

= B k {y) H p k B k {y) (18) 
= ti(B k (y)B k (y) H p k ), (19) 

where B k (y) is the y-th column of B k . After measur- 
ing/observing Y k = y, the density matrix is 



~ Pk =B k {y)B k {y)". 



(20) 

□ 



Note that d20b is properly normalized because 

tr{B k {y)B k {yf)= tr(B k (y)» B k (y)) = \\B k (y)\\ 2 = 1. 

In the special case of Fig. [HJ with known initial state 
Xq = xq, the matrix p k factors as 



p k (x k ,x' k ) = ip k (x k )ip k (x' k ), 
or, in matrix notation, 

Pk = ^feV'fc) 



(21) 



(22) 



where ip k is a column vector of norm 1 . The post-measurement 
density matrix p k factors analoguously, as is obvious from 
<\20\ . In quantum-mechanical terms, ip k is the quantum state. 
The probability ( p"8| ) can then be expressed as 

P{Y k =y\Y k _ l =y k _ u ...,Y l = y l ) 

= B k {yf^B k {y) (23) 
= \\B k {y) H i> k f, (24) 

which is the most basic form of computing probabilities in 
quantum mechanics. 

Proposition 5 (General Measurement). In Fig. [JJ (general- 
ized to any number of observations), if Y k is measured as in 



Fig. 11 then 

P{Y k =y\Y k - 1 =y k - 1 ,...,Y 1 = y 1 ) 

= tr(A k (y)p k A k (y) H ). (25) 

After measuring/observing Y k = y, the density matrix is 

A k (y)p k A k (y) H 



p k 



tr(A k (y) Pk A k (y) H ) 



(26) 

□ 



According to Propositions [2J43J the factor graph of Fig. [7] 
(with measurements as in Fig . [8] or as in Fig. 1 1 1 yields indeed 
the correct quantum-mechanical probabilities for the respective 
class of measurements. 

V. Conclusion 

We have proposed a class of factor graphs that represent 
quantum-mechanical probabilities involving any number of 
measurements, both for basic projection measurements and for 
general measurements as in |9| Chap. 2]. Such factor graphs 
have not previously been used in statistical modeling. 

The space constraints of this paper preclude the discussion 
of further pertinent topics that we intend to address elsewhere, 
including the meaning of such factor graphs from a statistical- 
modeling point of view (disregarding physics), the relation to 
quantum Bayesian networks (see, e.g., (13)), to quantum belief 
propagation (see, e.g., fl6)), and to tensor diagrams/networks 
for analyzing quantum systems (see, e.g., fT7)). It is also 
noteworthy that quantum circuits as in (9] Chap. 4] may be 
viewed as halves of factor graphs as in this paper, where the 
missing other half is a complex conjugate mirror image as in 
Section gill 
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